AITopics | online supplement

Collaborating Authors

online supplement

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Self-Attention as Transport: Limits of Symmetric Spectral Diagnostics

Dahlem, Dominik, Maniloff, Diego, Misiura, Mac

arXiv.org Machine LearningMay-7-2026

Large language models hallucinate in predictable ways: attention routing fails by over-concentrating on a narrow set of positions, or by spreading so diffusely that relevance is diluted, and the shape of the failure carries diagnostic signal. A widely used family of spectral methods analyzes the symmetric component of the degree-normalized attention operator, which governs transport capacity; we prove that every transpose-invariant spectral diagnostic of this operator is structurally orientation-blind (it cannot distinguish an operator from its transpose, and therefore cannot detect information-flow direction), with a quantitative converse establishing the asymmetry coefficient $G$ as the unique control parameter for direction. Pairing this with a closed-form bipartite-Cheeger landscape for canonical causal architectures, we show that uniform causal attention satisfies an $n$-independent floor $ϕ\ge 1/5$ with worst cut at $t^\ast/n \approx 0.32$, while window attention pierces the floor as $O(w/n)$; failure modes are shape-different, not just value-different. The resulting two-axis diagnostic ($ϕ$ for capacity, $G$ for direction) yields a falsifiable polarity prediction: bottleneck- and diffuse-dominated benchmarks should exhibit opposite polarity. Under length-controlled evaluation, transport features retain interpretable signal (LC-AUROC from 0.62 to 0.84) on tested models up to 8B parameters, with polarity reversing as predicted between HaluEval and MedHallu.

large language model, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2605.04893

Country:

North America > United States (0.46)
Europe (0.28)

Genre:

Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)

Add feedback

Personalized Machine Learning: Online Supplement

#artificialintelligenceJun-19-2021, 15:30:30 GMT

The book is currently available in draft form as a downloadable pdf. Every day we interact with machine learning systems that personalize their predictions to individual users, whether to recommend movies, find new friends or dating partners, or organize our news feeds. Such systems involve several modalities of data, ranging from sequences of clicks or purchases, to rich modalities involving text, images, or social interactions. While settings and data modalities vary significantly, in this book we introduce a common set of principles and methods that underpin the design of personalized predictive models. The book begins by revising "traditional" machine learning models, with a special focus on how they should be adapted to settings involving user data.

modality, online supplement, personalized machine learning, (2 more...)

#artificialintelligence

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.08)

Industry: Information Technology (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Automated extraction of mutual independence patterns using Bayesian comparison of partition models

Marrelec, Guillaume, Giron, Alain

arXiv.org Machine LearningJan-15-2020

Mutual independence is a key concept in statistics that characterizes the structural relationships between variables. Existing methods to investigate mutual independence rely on the definition of two competing models, one being nested into the other and used to generate a null distribution for a statistic of interest, usually under the asymptotic assumption of large sample size. As such, these methods have a very restricted scope of application. In the present manuscript, we propose to change the investigation of mutual independence from a hypothesis-driven task that can only be applied in very specific cases to a blind and automated search within patterns of mutual independence. To this end, we treat the issue as one of model comparison that we solve in a Bayesian framework. We show the relationship between such an approach and existing methods in the case of multivariate normal distributions as well as cross-classified multinomial distributions. We propose a general Markov chain Monte Carlo (MCMC) algorithm to numerically approximate the posterior distribution on the space of all patterns of mutual independence. The relevance of the method is demonstrated on synthetic data as well as two real datasets, showing the unique insight provided by this approach.

independence, partition, probability, (14 more...)

arXiv.org Machine Learning

2001.05407

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > France > Île-de-France > Paris > Paris (0.04)
Europe > Czechia > Prague (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.67)
Health & Medicine > Therapeutic Area > Immunology > HIV (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Detection of Block-Exchangeable Structure in Large-Scale Correlation Matrices

Perreault, Samuel, Duchesne, Thierry, Nešlehová, Johanna G.

arXiv.org Machine LearningNov-22-2017

Correlation matrices are omnipresent in multivariate data analysis. When the number $d$ of variables is large, the sample estimates of correlation matrices are typically noisy and conceal underlying dependence patterns. We consider the case when the variables can be grouped into $K$ clusters with exchangeable dependence; an assumption often made in applications in finance and econometrics. Under this partial exchangeability condition, the corresponding correlation matrix has a block structure and the number of unknown parameters is reduced from $d(d-1)/2$ to at most $K(K+1)/2$. We propose a robust algorithm based on Kendall's rank correlation to identify the clusters without assuming the knowledge of $K$ a priori or anything about the margins except continuity. The corresponding block-structured estimator performs considerably better than the sample Kendall rank correlation matrix when $K < d$. Even in the unstructured case $K = d$, though there is no gain asymptotically, the new estimator can be much more efficient in finite samples. When the data are elliptical, the results extend to linear correlation matrices and their inverses. The procedure is illustrated on financial stock returns.

artificial intelligence, data mining, machine learning, (15 more...)

arXiv.org Machine Learning

1706.0594

Country:

North America > United States (0.67)
North America > Canada > Quebec (0.28)

Genre: Research Report (1.00)

Industry:

Semiconductors & Electronics (1.00)
Leisure & Entertainment (1.00)
Information Technology > Services (1.00)
(5 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.67)

Add feedback

Non-stationary Stochastic Optimization with Local Spatial and Temporal Changes

Chen, Xi, Wang, Yining, Wang, Yu-Xiang

arXiv.org Machine LearningAug-31-2017

We consider a non-stationary sequential stochastic optimization problem, in which the underlying cost functions change over time under a variation budget constraint. We propose an $L_{p,q}$-variation functional to quantify the change, which captures local spatial and temporal variations of the sequence of functions. Under the $L_{p,q}$-variation functional constraint, we derive both upper and matching lower regret bounds for smooth and strongly convex function sequences, which generalize previous results in (Besbes et al., 2015). Our results reveal some surprising phenomena under this general variation functional, such as the curse of dimensionality of the function domain. The key technical novelties in our analysis include an affinity lemma that characterizes the distance of the minimizers of two convex functions with bounded $L_p$ difference, and a cubic spline based construction that attains matching lower bounds.

artificial intelligence, machine learning, optimization, (18 more...)

arXiv.org Machine Learning

1708.0302

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

Add feedback